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Amendments to the Claims 

Please cancel Claims 3 and 9. Please amend Claims 1, 7, 13 and 21. Please add new 
Claims 29 and 30. The Claim Listing below will replace all prior versions of the claims in the 
application: 

Claim Listing 

1 . (Currently amended) A method for determining a plurality of hypothetical matches to a 
spoken input, comprising the computer-implemented steps of: 

detecting sub word units in the spoken input to generate a first set of hypothetical 
matches to the spoken input; 

detecting words in the spoken input to generate a second set of hypothetical 
matches to the spoken input; and 

combining the first set of hypothetical matches with the second set of hypothetical 
matches to produce a combined set of hypothetical matches to the spoken input, the 
combined set having a predefined number of hypothetical matches, and wherein the 
combined set of hvpothetical matches is an ordered list comprising a highe st ranking 
hvpothetical match in the second set of hvpothetical matches, follo wed bv an ordered set 
of hvpothetical matches based on the first set of hvpothetical matches . 

2. (Original) The method of Claim 1 , wherein the step of detecting subword units includes 

detecting the subword imits in the spoken input based on an acoustic model of the 
subword units and a language model of the subword units; 

generating pattern comparisons between (i) an input pattern corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
pronunciation dictionary, each generated pattern comparison based on the input pattern 
and one of the reference patterns; and 

generating the first set of the hypothetical matches by sorting the source set of 
reference patterns based on a closeness of each reference pattern to correctly matching the 
input pattern based on an evaluation of each generated pattern comparison, each 
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evaluation determining a word pronunciation distance measure that indicates how close 
each input pattern is to matching each reference pattern. 

3. (Cancelled) 

4. (Original) The method of Claim 1, wherein the combined set of hypothetical matches is 
an ordered list based on ranking confidence levels for each hypothetical match. 

5. (Original) The method of Claim 1 , wherein the subword imits include at least one 
phoneme. 

6. (Original) The method of Claim 1, wherein the hypothetical matches are words. 

7. (Currently amended) A computer system for determining a plurality of hypothetical 
matches to a spoken input, comprising: 

a subword decoder for detecting subword units in the spoken input to generate a 
first set of hypothetical matches to the spoken input; 

a word decoder detecting words in the spoken input to generate a second set of 
hypothetical matches to the spoken input; and 

a list fusion module for combining the first set of hypothetical matches with the 
second set of hypothetical matches to produce a combined set of hypothetical matches to 
the spoken input, the combined set having a predefined number of hypothetical matches^ 
and wherein the combined set of hvpothetical matches is an ordered list comprising a 
highest ranking hvpothetical match in the second set of hypothetical matches^ followed bv 
an ordered set of hvpothetical matches based on the first set of hvpothetical matches . 



8. 



(Original) The computer system of Claim 7, wherein the subword decoder 

detects the subword units in the spoken input based on an acoustic model of the 
subword units and a language model of the subword units; 
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generates pattern comparisons between (i) an input pattern corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
pronunciation dictionary, each generated pattern comparison based on the input pattern 
and one of the reference patterns; and 

generates the first set of the hypothetical matches by sorting the source set of 
reference patterns based on a closeness of each reference pattern to correctly matching the 
input pattern based on an evaluation of each generated pattern comparison, each 
evaluation determining a word pronunciation distance measure that indicates how close 
each input pattern is to matching each reference pattern. 

9. (Cancelled) 

10. (Original) The computer system of Claim 7, wherein the combined set of hypothetical 
matches is an ordered list based on ranking confidence levels for each hypothetical match. 

1 1 . (Original) The computer system of Claim 7, wherein the subword units include at least 
one phoneme. 

12. (Original) The computer system of Claim 7, wherein the hypothetical matches are words. 

13. (Currently amended) A computer program product comprising: 

a computer usable medium for determining a plurality of hypothetical matches to 
a spoken input; and 

a set of computer program instructions embodied on the computer useable 
medium, including instructions to: 

detect subword units in the spoken input to generate a first set of hypothetical 
matches to the spoken input; 

detect words in the spoken input to generate a second set of hypothetical matches 
to the spoken input; and 
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combine the first set of hypothetical matches with the second set of hypothetical 
matches to produce a combined set of hypothetical matches to the spoken input, the 
combined set having a predefined number of hypothetical matches , and the combined set 
being an ordered list comprising a highest ranking hypothetical match in the second set of 
hypothetical matches, followed by an ordered set of hypothetical matches based on the 
first set of hypothetical matches . 

14. (Original) A method for determining a plurality of hypothetical matches to a spoken input 
by detecting subword units in the spoken input, comprising the computer-implemented 

steps of: 

detecting the subword units in the spoken input based on an acoustic model of the 
subword units and a language model of the subword imits; 

generating pattern comparisons between (i) an input pattern corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
pronunciation dictionary, each generated pattem comparison based on the input pattern 
and one of the reference patterns; and 

generating a set of the hypothetical matches by sorting the source set of reference 
pattems based on a closeness of each reference pattem to correctly matching the input 
pattem based on an evaluation of each generated pattem comparison, each evaluation 
determining a word pronunciation distance measure that indicates how close each input 
pattem is to matching each reference pattem. 

15. (Original) The method of Claim 14, wherein the pattem comparisons are based on a 
confiision matrix that stores the likelihood of confusion between pairs of subword units, 
the likelihood of deleting each subword unit, and the likelihood of inserting each subword 
unit. 

16. (Original) The method of Claim 15, further comprising a step of training the confusion 
matrix based on an output of an subword decoder, the output produced from an acoustic 
input of a training data set input to the subword decoder. 
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1 7. (Original) The method of Claim 1 5, further comprising the step of computing the 
confusion matrix by determining an entry in the confusion matrix for each unique 
subword unit that is in the set of reference patterns. 

1 8. (Original) The method of Claim 14, wherein the step of detecting subword units is 
performed in a client computer, and the steps of generating pattern comparisons and 
generating the set of hypothetical matches are performed in a server computer. 

19. (Original) The method of Claim 14, wherein the step of generating the set of hypothetical 
matches includes: 

determining pairs of subword units by pairing an input subword unit from the 
input pattern with a reference subword unit from the reference pattern; and 

providing the word pronunciation distance measure by calculating a distance 
metric for each pair of subword units, the distance metric defined as follows: 



S(po,do) = 0 

S{pi - 1, flSf - 1) + Csubs{pi, dj) 
S(ph dj) = mini S(pi - 1, dj) + Cdei{pi) 

S(pi,dj-l) + Cins(d/) 
5(P, D) = S(pn, dm) + LP(pn, dm) 



wherein: 

S(P,D) is a distance between word P and D; P is a given input pattern, and D, a 
given reference pattern; 

S(Pi,dj) is a score of the given input pattern matching a given subword unit of P, 
and a given subword unit of D; 

Csubs(Pi^dj) is a cost of substituting the given subword unit of P vdth the given 
subword unit dj of D; 
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Cdei(p) IS a cost of deleting the given subword unit p; of P; 

Cins(d) is a cost of inserting the given subword unit dj of D; 

LPQ}^4m) is a length penalty of the given input pattern p„ matching the given 
reference pattern d^,n is the length of P, and m is the length of D; 

S(Pi^j,djJ has a value of zero (0) if Pi,j,dj.j is undefined; 

S(Pi_j,dj) has the value of zero (0) if ^ is undefined; 

S(Pi,dj,j) has the value of zero (0) if Pi^dj,^ is undefined; and 

the distance metric for each pair of subword units is calculated in a sequence such 
that S(Pi.i4jJ. S(Pi,i,dj), and S(Pi,djJ are determined previously to determining S(Pi,d). 

20. (Original) The method of Claim 14, wherein the subword units include at least one 
phoneme. 

21 . (Currently amended) [[An]] A computer system for determining a plurality of 
hypothetical matches to a spoken input by detecting subword units in the spoken input, 

comprising: 

a subword decoder for detecting the subword units in the spoken input based on 
an acoustic model of the subword imits and a language model of the subword units; and 

a subword detection vocabulary look up module for generating pattern 
comparisons between (i) an input pattern corresponding to the subword units in the 
spoken input and (ii) a source set of reference patterns based on a pronunciation 
dictionary, each generated pattern comparison based on the input pattern and one of the 
reference patterns; 

the subword detection vocabulary look up module generating a set of the 
hypothetical matches by sorting the source set of reference patterns based on a closeness 
of each reference pattem to correctly matching the input pattern based on an evaluation of 
each generated pattem comparison, each evaluation determining a word pronunciation 
distance measure that indicates how close each input pattem is to matching each reference 
pattem. 
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22. (Original) The computer system of Claim 21, wherein the pattern comparisons are based 
on a confusion matrix that stores the likelihood of confusion between pairs of subword 
units, the likelihood of deleting each subword unit, and the likelihood of inserting each 
subword unit. 

23. (Original) The computer system of Claim 22, wherein the confusion matrix is trained on 
an output of the subword decoder, the output produced from an acoustic input of a 
training data set to the subword decoder. 

24. (Original) The computer system of Claim 22, wherein the confusion matrix is based on 
determining an entry in the confusion matrix for each unique subword unit that is in the 
set of reference pattems. 

25. (Original) The computer system of Claim 21 , wherein the subword decoder is part of a 
client computer, and the subword detection vocabulary look up module is part of a server 
computer. 

26. (Original) The computer system of Claim 21, wherein the subword detection vocabulary 
look up module 

determines pairs of subword units by pairing an input subword unit from the input 
pattern with a reference subword unit from the reference pattern; and 

provides the word pronunciation distance measure by calculating a distance metric 
for each pair of subword units, the distance metric defined as follows: 
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S(po,dQ) = 0 

S(pi -\ydj-\) + Csubs{pi, dj) 
S{pi,dj) = min< -\ydj) + Cdei^pi) 
S{pi,dj-\) + Cim{dj) 

S(P,D) = Sipn.dm) + LP(pn,dm) 



wherein: 

S(P,D) is a distance between word P and D; P is a given input pattern, and D, a 
given reference pattern; 

S(Pi,dj) is a score of the given input pattern matching a given subword unit /?, of P, 
and a given subword unit dj of D; 

Csubs(Pi>d) is a cost of substituting the given subword unit A of P with the given 
subword unit dj of D; 

Cdei(p) is a cost of deleting the given subword unit p^ of P; 

Ci^(d^ is a cost of inserting the given subword unit dj of D; 

LP(p„,dJ is a length penalty of the given input pattern p„ matching the given 
reference pattern d^,n is the length of P, and m is the length of D; 

S(Pi.i,dj_j) has a value of zero (0) if A.y^^-^ undefined; 

S(Pi.i.dj) has the value of zero (0) if p^_j,dj is undefined; 

S(ppdjJ has the value of zero (0) if (Pi4j.i) is undefined; and 

the distance metric for each pair of subword units is calculated in a sequence such 
that S(Pi,],djJ, S(Pi,j,dj), and S(Pi,djJ are determined previously to determining S(Pi,dj). 

27. (Original) The computer system of Claim 21, wherein the subword units include at least 
one phoneme. 



28. 



(Original) A computer program product comprising: 



a computer usable medium for determining a plurality of hypothetical matches to 
a spoken input by detecting subwords in the spoken input; and 

a set of computer program instructions embodied on the computer useable 
medium, including instructions to: 

detect the sub word units in the spoken input based on an acoustic model of the 
subword units and a language model of the subword imits; 

generate pattern comparisons between (i) an input pattern corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
pronunciation dictionary, each generated pattern comparison based on the input pattern 
and one of the reference patterns; and 

generate a set of the hypothetical matches by sorting the source set of reference 
patterns based on a closeness of each reference pattern to correctly matching the input 
pattern based on an evaluation of each generated pattern comparison, each evaluation 
determining a word pronunciation distance measure that indicates how close each input 
pattern is to matching each reference pattern. 

(New) A method for determining a plurality of hypothetical matches to a spoken input, 
comprising the computer-implemented steps of: 

detecting subword units in the spoken input to generate a first set of hypothetical 
matches to the spoken input; 

detecting words in the spoken input to generate a second set of hypothetical 
matches to the spoken input; and 

combining the first set of hypothetical matches with the second set of hypothetical 
matches to produce a combined set of hypothetical matches to the spoken input, the 
combined set having a predefined number of hypothetical matches, wherein the step of 
detecting subword units includes 

detecting the subword units in the spoken input based on an acoustic model of the 
subword imits and a language model of the subword units; 

generating pattern comparisons between (i) an input pattem corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
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pronunciation dictionary, each generated pattern comparison based on the input pattern 
and one of the reference patterns; and 

generating the first set of the hypothetical matches by sorting the source set of 
reference patterns based on a closeness of each reference pattern to correctly matching the 
input pattern based on an evaluation of each generated pattern comparison, each 
evaluation determining a word pronimciation distance measure that indicates how close 
each input pattern is to matching each reference pattern. 

(New) A computer system for determining a plurality of hypothetical matches to a spoken 
input, comprising: 

a subword decoder for detecting subword units in the spoken input to generate a 
first set of hypothetical matches to the spoken input; 

a word decoder detecting words in the spoken input to generate a second set of 
hypothetical matches to the spoken input; and 

a list fiision module for combining the first set of hypothetical matches with the 
second set of hypothetical matches to produce a combined set of hypothetical matches to 
the spoken input, the combined set having a predefined number of hypothetical matches, 

wherein the subword decoder: 

detects the subword vmits in the spoken input based on an acoustic model of the 
subword xmits and a language model of the subword units; 

generates pattern comparisons between (i) an input pattern corresponding to the 
subword units in the spoken input and (ii) a source set of reference patterns based on a 
pronunciation dictionary, each generated pattem comparison based on the input pattern 
and one of the reference patterns; and 

generates the first set of the hypothetical matches by sorting the source set of 
reference patterns based on a closeness of each reference pattem to correctly matching the 
input pattem based on an evaluation of each generated pattem comparison, each 
evaluation determining a word pronunciation distance measure that indicates how close 
each input pattem is to matching each reference pattem. 



